Prediction error in partial least squares regression: a critique on the deviation used in The Unscrambler
نویسنده
چکیده
Partial least squares (PLS) regression is commonly used for multivariate calibration of instruments. Because of the need to know the quality of the prediction in a specific unknown sample and the lack of theory, an 'empirically found formula' to express the uncertainty is utilized in The Unscrambler II software, the de-facto standard in computer software for PLS. In this critique the formula is examined theoretically and by simulation. It is concluded that this formula underestimates the root mean squared error of prediction in most practical applications of PLS. A change of the formula is planned in the next version of The Unscrambler. In the mean time users of The Unscrambler ver 5.5 or lower should multiply the reported deviation by a factor of at least 1/2(1 (A + 1 ) / n ) , to get a reasonable estimate of the prediction error.
منابع مشابه
RICE QUALITY AND PROCESSING Apparent Amylose Content Prediction using Near Infrared Spectroscopy of Individual and Bulk Rice Kernels
For this study seventy-eight rice samples from 2008 of mediumand long-grain rice cultivars were harvested and selected from Rohwer, Keiser, and Corning locations in Arkansas. Bulk milled rice samples of ~11.5 grams and 100 individual milled rice kernels fitted with single kernel adapter were scanned using a diode-array-analyzer of each cultivar/location. Partial least squares regression (PLS) w...
متن کاملDetermination of Protein and Moisture in Fishmeal by Near-Infrared Reflectance Spectroscopy and Multivariate Regression Based on Partial Least Squares
The potential of Near Infrared Reflectance Spectroscopy (NIRS) as a fast method to predict the Crude Protein (CP) and Moisture (M) content in fishmeal by scanning spectra between 1000 and 2500 nm using multivariate regression technique based on Partial Least Squares (PLS) was evaluated. The coefficient of determination in calibration (R2C) and Standard Error of Calibra...
متن کاملLeast-squares support vector machine and its application in the simultaneous quantitative spectrophotometric determination of pharmaceutical ternary mixture
This paper proposes the least-squares support vector machine (LS-SVM) as an intelligent method applied on absorption spectra for the simultaneous determination of paracetamol (PCT), caffeine (CAF) and ibuprofen (IB) in Novafen. The signal to noise ratio (S/N) increased. Also, In the LS - SVM model, Kernel parameter (σ2) and capacity factor (C) were optimized. Excellent prediction was shown usin...
متن کاملSimplex design method in simultaneous spectrophotometric determination of silicate and phosphate in boiler water of power plant and sewage sample by partial least squares
Partial least squares modeling as a powerful multivariate statistical tool was applied tothe simultaneous spectrophotometric determination of silicate and phosphate in aqueoussolutions. The concentration range for silicate and phosphate were 0.02-0.6 and 0.4-3 μg ml-1,respectively. The experimental calibration set was composed with 30 sample solutions using amixture design for two component mix...
متن کاملSimultaneous Spectrophotometric Determination of Iron, Cobalt and Copper by Partial Least-Squares Calibration Method in Micellar Medium
Iron, cobalt and copper are metals, which appear together in many real samples, both natural and artificial. Recently a classical univariate micellar colorimetric method has been developed for determination of these metal ions. The organized molecular assemblies such as micelles are used in spectroscopic measurements due to their possible effects on the systems of interest. The ability of mi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994